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LANUGAGE SENSITIVE ELECTRONIC MAIL GENERATION AND 
ASSOCIATED APPLICATIONS 

RELATED APPLICATION 
This application claims priority to U.S. Provisional Application number 
60/164,585 entitled "System and Method for Obtaining and Collating Survey 
Information in Real Time For Multiple Languages and Multiple Character 
Encodings", filed on November 10, 1999, which is hereby fully incorporated by 
reference. 

FIELD OF THE INVENTION 
This invention Is in the field of infonnation processing. More specifically 
this invention has to do with the customization of electronic mail messages. 

BACKGROUND OF THE INVENTION 
When dealing with electronic mail communications that occur across 
language boundaries, there are two issues that come to the fore. The first is 
the ability of the computers to handle various languages. The second is the 
ability of the users of the computers to handle various languages. 

Computers in general, and electronic mail more specifically, were 
nurtured in an environment that was largely centered around the English 
language. Resultantly early efforts in defining a set of characters to be used by 
computers were based on English. The first set of characters developed was 
the US-ASCII character set. This character set could be represented with 7 
bits of data. A side effect of this 7 bit representation was the development of a 
mail transfer protocol called Simple Mail Transfer Protocol which supports only 
7 bit data. The need to support the European languages, and the additional 
characters, relative to English, defined therein, caused a need for 8 bits of data 
to be used in representing an entire character set. The need to support Asian 
languages, with the large number of characters present in these languages, 
caused the required number of bits for representing these languages to grow 
such that 2 octets (8 bits/octet) were used to represent these character sets. 
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and from this need developed yet more character sets. Although significant 
progress has been made in recent years to evolve to a unified standard, the 
fact remains that there are still various user agents that can only support one 
character set. The absence of a ubiquitous standard among user agents 
results in a variety of character set encoding be used to display text in a way 
that is appropriate for each language. Failure to match the proper character 
encoding with the conrespondlng text can render the Infomnation unviewable. 

Typically, companies wishing to communicate with their customers will 
do so in a generic fashion. In such a case, the result is that every user of a 
company's system will be treated the same in an electronic communication 
from a company. A person in Poland will receive the same communication as a 
person in Japan. This can be undesirable in that communications are very 
language and culture sensitive. A generic message that is sent to all users of a 
system may be perfectly suited for one culture and alienate another culture to 
the point of hurting business In that country. 

In a similar vane, there are times when it is desired to have infonnation 
fed to the company. When things are sent to a representative of a company, 
one runs into a potential for the same cultural insensitive as was alluded to with 
respect to the generic emails. 

There Is a need to have a electronic mail system that can handle the 
disparate needs of various languages and cultures. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 - Typical electronic mail passage through the Internet 
Figure 2 - Typical electronic mail compilation stage. 
Figure 3 - System generation of electronic mail message 
Figure 4A, 4B - Examples embodiment of client databases 
Figure 5 - Embodiment to write correct header information for proper 

language haruJIing. 
Figure 6 - Partial character set for ISO-8859-1 (Latini ) 
Figure 7 - Partial character set for ISO-8859-7 (Greek) 
Figure 8 - Embodiment of a website with selection for of a language. 
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Figure 9 - Portion of an embodiment to write an electronic mail header 
witii customized electronic mail recipient based on users 
language. 

Figure 10 - Routine to determine customized recipients electronic mall 
address. 

Figure 1 1 - Embodiment to create culturally specific salutations 
Figure 12 - Code example for database lookup for language specific 
title. 

Figure 13 - Exemplary database of lists containing culturally specific 
titles. 

SUMMARY OF THE INVENTION 
Emails that are generated, as part of an automated or semi-automated 
process, are to be language sensitive. It is possible, by detennining the 
prefenred language of the user of a company's information services, to 
customize the communication to the user, or in the case of a user using a 
server generating electronic mail message to a company, to customize 
messages to the company, based on users chosen language or an 
automatically determined language. This customization affects two levels of an 
electronic mail message. The first level of customization is the header 
information provided to the electronic mail message to allow proper 
interpretation of the electronic mail message on the way to and once received 
at the receiving end. In one embodiment of the present invention, the MIME- 
Version, Content-Type, and Content-Transfer-Encoding are all customized 
based on the appropriate language. The second level of customization is to 
customize information to be written to the addressee section of the body of the 
electronic mail message to improve user perception of a generated email. In 
one embodiment of the present invention, the recipient of an electronic mail is 
expressed in a manner dependent on the chosen language. In one 
embodiment of the present invention, the salutation to a recipient of an 
electronic mail message is customized based on the chosen language. 
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DETAILED DESCRIPTION OF THE tNVENTION 
In the foilov^ng description, various aspects of the present invention will 
be described. However, it will be apparent to those skilled in the art that the 
present invention may be practiced with only some or all aspects of the present 
invention. For purposes of explanation, specific numbers, materials and 
configurations are set forth in order to provide a thorough understanding of the 
present invention. However, it will also be apparent to one skilled in the art that 
the present invention may be practiced without the specific details. In other 
instances, well known features are omitted or simplified in order not to obscure 
the present invention. 

Parts of the description will be presented in terms of operations 
performed by a processor based device, using temis such as data, tables, 
requesting, determining, retrieving, displaying, and the like, consistent with the 
manner commonly employed by those skilled in the art to convey the substance 
of their work to others skilled in the art. As well understood by those skilled in 
the art, the quantities take the form of electrical, magnetic, or optical signals 
capable of being stored, transferred, combined, and otherwise manipulated 
through mechanical and electrical components of the processor based device; 
and the term processor include microprocessors, micro-controllers, digital 
signal processors, and the like, that are standalone, adjunct or embedded. 

Various operations will be described as multiple discrete steps in turn, in 
a manner that is most helpful in understanding the present invention, however, 
the order of description should not be construed as to imply that these 
operations are necessarily order dependent. In particular, these operations 
need not be performed in the order of presentation. Further, the description 
repeatedly uses the phrase "in one embodiment", which ordinarily does not 
refer to the same embodiment, although it may. 

Overview 

The present invention deals with two main issues. First is the automated 
insertion in an electronic mail, a header including language information to aid in 
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the transfer of the electronic mail message to the receiving end user agent. 
Second is the writing of infonnation to the addressee section of the body of the 
electnanic mail message in a language dependent manner to provide a more 
user-friendly experience to the users (the sender as well as the recipient). 

Figure 1 shows a typical route of an electronic mail message as it is sent 
via the Internet. The user agent 101 is an application typically run by a user. 
The user agent will be used to compose and send the outgoing message. The 
transfer agent 120 is used to transfer the message to the message's final 
destination through networking fabric 130. The networking fabric 130 is 
comprised of switching/routing devices known in the art. The delivery agent 
140 is used to place the message into the end users mailbox (not shown). The 
end user will then use a user agent 150 to view the message. 

An electronic mail message can be viewed as consisting of two parts. 
The first part is the header. The header contains various "internal" fields that 
impart information about various aspects of the electronic mail message. 
These aspects of an electronic mail message typically include: the electronic 
mail address of the sender, the electronic mall address of the recipient, the 
subject, the time of creation and delivery information. There is a particular 
fomnat for each header entry. Each entry in the header is of the form "keyword: 
value". There is always a blank line between the header and the body of the 
electronic mail message. The contents of the header are provided mainly by 
the user agent 110 that is helping the user create the electronic mail message. 
Some elements are provided by the user to the user agent for putting into the 
proper header format in the electronic mail message. 

The second part of the electronic mail message is the body of the 
electronic mail message, including typically an addressee section and a 
message text section. This is typically provided by the user. The user will 
typically have a user agent with some kind of text editing capability. The user 
will type in the body of the message using this text editor. More detailed 
information on the fundamentals of electronic mail messages can be read in 
Request For Comment (RFC) 822. This document is incorporated herein by 
reference. 
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User agents 101 and 150 typically execute on respective user computer 
systems, while transfer and delivery agents 120 and 140 typically execute on 
respective servers. Examples of suitable user computer systems include palm 
sized personal digital assistants, notebook size computers, and desktop 
computers, as well as set top boxes (with storage medium storing the 
programming instmctions implementing user agents 101 and 150 and at least 
one processor coupled to the storage medium to execute tiie programming 
instructions), available from manufacturers such as Palm Computing, of San 
Jose, CA and Dell Computer of Austin, Texas. Similarly, suitable servers 
include a wide range of servers available from manufacturers such as IBM of 
Armonk, NY and Sun Microsystems of Menio Park, CA. 

Figure 2 shows a typical method for composing electronic mail 
messages. In the typical flow, the user agent will utilize three type of 
infonnation to generate a complete electronic mail message. The first type of 
Infomiation that the user agent 240 will need in order to create header entries 
In the electronic mail message is "use-specific" infomiation 210. The "use- 
specific" Infonnation 210 will vary, in most instances, with every electronic mail 
message. A typical example of what "use-specific" infonnation 210 would be 
the recipient's electronic mall address. The user agent 240 will take the 
recipient's electronic mail address for this message and write the proper field to 
the electronic mall message header. 

The second type of information a user agent 240 would typically look to 
is either user specified global (e.g. across many electronic mail messages) 
information or default (if there is none specified) or system information 220. 
Examples of this type of infonmation would be the date and time stamp of the 
electronic mail. This second type of infonnation will make up the remainder of 
the header Information. 

Finally, the user agent 250 will typically write the body (including the 
addressee section) as provided 230 by the user to the user agent 250. 

Increasingly electronic mail messages are automatically generated as 
part of a larger application system. In one embodiment of the present invention 
shown in Figure 3, the server 320 is providing an advertisement to a number of 
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prospective clients. In this case, the server 320 is acting as both the user 
agent and the first transfer agent. Acting in its capacity as a user agent, the 
sender will generate the electronic mail message to be transfenred. In this 
embodiment of the invention, the electronic mail message will be generated 
from two sources of infonnation. The first source is an advertisement from an 
advertisement database 330. In one embodiment of the present invention, the 
advertisement databases will have separate advertisements for each language 
supported by the present invention. The second source will be infonmation 
about the prospective client from a prospective client database 310. In one 
embodiment, the server of an automated system can read information about a 
user's language from the client database 330. From this information about the 
user, the server can then choose from the selection of different language 
versions of the advertisement in the advertisement database, the appropriate 
version to be sent to the users. This advertisement can then be written to the 
body of an electronic mail message. 

Figure 4A shows the contents of a typical prospective client database 
310 used in the embodiment of Figure 3. In this embodiment of the present 
invention, the database will contain a recipient's electronic mail address. 
Additionally, we see that there is an indication of the electronic mail recipients' 
language of choice. For example, we see that user of the system whose 
electronic mail address is prospectiveClient@ispToday.com 410 has a 
language preference of Korean. 

Proper encoding 

As will be understood by those skilled in the art, this discussion assumes 
some knowledge of the standards as set forth in the documentation for the 
Multipurpose Internet Mail Extensions (MIME). The Network Working Group's 
RFCs 2045-2049 define this standard and these documents are Incorporated 
herein by reference. 

It is desirable to make sure that a recipient will be able to read an 
electronic mall message. To that end, the present invention is concerned with, 
among other things, the automatic customization of header information, in an 
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electronic mail message, to allow the message to be passed tfirough to the 
recipient correctly. 

Assume, for example, that an advertisement is customized to a 
particular language, say German. Further, lets say that the advertisement is 
written in text/plain and is written using the iso-8859-1 character set. In such a 
case, if an electronic mail message containing this advertisement is sent via the 
Internet, without the proper header infomiation, it may well be ill received at the 
recipients end. This is due to the fact that iso-8859-1 , an 8 bit character set, 
will be interpreted, by default, as a 7bit US-ASCII character set. So, for 
example, when this 8 bit data is received, by an a transfer agent running 
Simple Mali Transfer Protocol (SMTP), which restricts electronic mail 
messages to 7bit US-ASCII data, reliable and consistent handling of the 
message is not guaranteed. 

in the present invention, when an electronic mail is generated 
automatically, it is desirable to be able to customize the electn^nic mail 
message to the recipient to avoid the aforementioned problems. Figure 5 
shows a section of code, from one embodiment of the present invention, 
designed to write the proper header information to an electronic mail message 
based on a language identifier. 

Referring to the embodiment shown in Figure 4B, we see a list of 
electronic mail addresses that represent purchasers of defective tires that have 
been recalled. In this embodiment, there exists a special electronic mail 
message notification of the recall. The message exists in various languages in 
a database. Based on the recipients preferred language, a message is sent in 
their language with the proper heading information. Take for example the first 
entry in the database shown in Figure 43. In this case, the electronic mail 
address is recipient1@anlSP.com. In this embodiment, we see additional 
infomnation is provided. For example, we see that the user whose electronic 
mail address is recipient1@anlSP.com has a preferred language of Portuguese 
as indicated by the identifier PR. Finally, we see that this database tracks the 
user's name. 
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Figure 5 shows an embodiment of a routine to aid in the writing of a 
correct, language sensitive electronic mail. In one embodiment, the database 
shown in Figure 4B can be used by the mail_header routine of Figure 5. The 
mail_header routine is called to write the appropriate header information to the 
electronic mail message header. 

The electronic mail address 410 and language fields 420 are parsed by 
the server and stored globally in the variables $contact_email and Slanguage, 
respectively, prior to the call of the maiLheader routine. Additionally, in this 
automated environment, there will be another routine that will access tiie 
database containing the message to be sent. The database will have a 
representation of the message in all languages supported by the server. 

The first step in this routine is to assign the recipient and sender 
addresses. In this case, the sender is an account called 
advertise@globaltestmarket.com. The recipient is simply determined by the 
$contact_email variable 510. For the first candidate from the example file of 
Figure 4, we have a Slanguage = PR and $reclp = recipient1@anlSP.com. 

The next step is to get the language specific subject line, in one 
embodiment, the subject line will be ascertained by the call to the routine 
get_message 520. Sent along to the call to the get_message routine, as 
shown in Figure 5, is ttie field identifier of the subject line (57) and the language 
of interest (Slanguage). In this example, the Slanguage would be used to fetch 
the proper message fi^m the database and the field identifier would allow 
proper indexing into the message to determine the subject line in the specific 
language. The get_message routine will return a string containing the subject 
of the message in the appropriate language for the recipient. 

In the embodiment shown, the processing of the specific header 
information to be placed in the electronic mail message is begun at this time. 
This is handled by the "if statement shown. The language for the first entry 
being Portuguese (PR), the if-statement will test true in the first test condition. 
The first line will open an output pipe to $mail_prog. In this embodiment, the 
mall program used Is Sendma// V8.8 running on a Linux OS. One skilled in the 
art will understand that the transfer agent utilized and the operating system 
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Upon which this invention is implemented are not relevant to the novelty of the 
present invention. In this example, since the message will be transmitted using 
an 8 bit character set, the code will infomi sendmail thai it is to use the 8 bit 
extension to SMTP (ESMTP) for this message. This Is done via the 
8BITMIME" option to sendmail. In the next step, information about the sender, 
receiver, subject and reply-to are written to the electronic mall header 550. 

Finally for this embodiment are the three fields that are used to 
specifically identify the information required to accommodate differences in 
character sets and MIME types. The MiME-Version header field 560 will be 
written to infomi transfer, delivery and end user agents of which version of the 
Internet message body fomiat standard to which this message complies. This 
is the chief header entry to inform the agents how to interpret the other header 
fields. The next field is the Content-Type field 570 and its associated character 
code set {charset) parameter. In this example, it defines the content type as 
text. The subtype is defined as HTML. The charset parameter infomris the end 
user agent how to translate the character encoding In this message. Further 
clarification on the character set encoding issue is required. 

Figures 6 and 7 show the character encoding for two different character 
sets. Figure 6 shows the a partial character set for ISO-8859-1 (Latin 1) and 
Figure 7 shows the corresponding portions of the character set for ISO-8859-7 
(Greek). Character sets provide a way of determining what a given octet will 
represent graphically. The two figures shown show the encodings for AO (1 60 
decimal) to FF (255 decimal) for two different types of encodings. As shown in 
Figure 6, if the ISO-8859-1 character set Is in use, then the character that 
con-esponds to a F6 octet 610 is and "O" with an umlaut. If as shown In Figure 
7, however, the ISO-8859-7 character set is In use, then the character that 
comesponds to a F6 octet 710 Is the Greek letter phi. The charset parameter 
informs the receiving client which character set was used to create the body of 
the electronic mail message. Resultantly the charset parameter informs the 
receiving agent how to properly translate the octets to characters. Proper 
identification of this parameter is critical In the con-ect handling of the electronic 
mail message as received by the user of the system. 
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The last of the three fields is the Content-Transfer-Encoding field 580 
. When using character sets which require greater than 7 bits to 
represent the entire character set (e.g. 8 bit or multi octet), special processing 
may be required when an electronic mail message is transfenred. For example, 
if a transfer agent has a message with 8 bit data, and it cannot negotiate the 
transfer of this message as 8 bit, it will need to encode the message into 7 bit, 
short line format. When a user agent creates a electronic mail message, the 
user agent may perform the encoding at the outset. In the present 
embodiment, Figure 5 shows an example where an electronic mail message, 
which was written using the UTF-8 character set 585, is to be encoded into a 
message that will be in 7-bit, short line format using the "quoted-printable" 
encoding transformation. For further information regarding the Content- 
Transfer-Encoding syntax and semantics, refer to Network Working Group RFC 
2056 § 6. 

Recipient Expressed in the Preferred Language 

Another aspect of the present invention is automatically expressing the 
recipient of a generated email in a language dependent manner, i.e. in 
accordance with a user's preferred language. An area where using a 
customer's language may be an advantage is in the customer service arena. 
Typically, when requesting support via electronic mail, the user will be provided 
an electronic mail address to which an electronic mail message should be sent. 
An aspect of the present invention will customize the expression of a recipient 
of an electronic mail message based on the users specified language. 

In one embodiment, as shown in Figure 8, the user has accessed a 
website for customer support for Company X's widgets. From this generic start 
screen for the customer support website, the user will pick the country from 
which he is operating. The user may, in this embodiment, enter a help request 
directly to the appropriate person. This appropriate person is determined by 
specific customization that can occur based on the user's language selection. 

In one embodiment, after composing the text of a text window built from 
HTML, the user clicks on send. Upon dtcking on send, the javascript code will 
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dynamically create the electronic mail message. The code will do so using 
techniques similar to the ones previously discussed, to generate the header 
information based on the users language. However, there is an additional level 
of customization that will occur in this case. Figure 9 shows a code snippet for 
a routine whose Job it is to send the electronic mail message that was 
composed in Figure 8. Here we can see that the destination of the electronic 
mall message will be based on the $recip variable. This variable is set during a 
call to the routine get_recip. Figure 10 shows the get_recip routine. The 
language of the user Is passed to it. Based on this language value, we see that 
a language specific recipient electronic mail address is chosen. This method, 
vis-a-vis a single electronic mail addressee for all customer service requests, 
provides several advantages. First, the electronic mail will be sent to the 
proper recipient initially, without having to be routed though another primary 
customer service address. This can result in time saving because there may 
be a delay in processing due to differences in time zone. Second, if there is 
trouble with the primary customer service mail server, this may not affect the 
other mail servers, enable this dynamic electronic mail message to be sent. 
Another advantage is that generic recipient names, such as marketing or 
service organization names, and so forth, can be generated in the native 
language. 

Cultural Specific Dynamically Created Salutation 

Another customization that can be performed based on a user's 
language preference is the use of cultural appropriate salutations. Salutations 
vary from one culture to another (and hence, roughly from one language to 
another). For example in English, the preferred method of addressing 
someone In a professional letter would be "Ms./Mr./Mrs./Miss <firstname> 
<lastname>". In contrast. In Japanese a salutation is done with the family 
name first, followed by honorific title. It would be lnapprc>priate. for example, to 
use the English standard when addressing users from Japan. It is possible to 
use database and program logic to customize the salutations to users based on 
language. 
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In one embodiment, assume that a user's language has been 
determined and, via some program logic, has been placed In the variable 
Slang. Refer at this time to Figures 1 1 , 1 2 and 1 3. Figures 1 1 , 1 2 and 1 3 show 
an embodiment of the invention using Perl and SQL code for creating a 
culturally sensitive salutation. Refening to the Pert code of Figure 1 1 , we see 
that a call to a routine getjist_value 1101 is made. The getjist_value routine 
is shown in Figure 12. The routine is provided with a language identifier ($lang) 
and the identifier in the database for the list where the salutations are located 
(10) and the equivalent "salutation key" to be returned ($title - e.g. Mr. or Mrs.). 
With this information, a query is made into the database 1210 and a field is 
retumed 1220. The get_list_value routine then parses the database field 
returned by the query 1220 to get the appropriate salutation 1230. The 
salutation is then returned 1240. So to summarize, this routine only retums 
what the appropriate salutation is for a language, not what the format is. 
Retuming to Figure 1 1 , we see that this is where the salutation is created. A 
test is made for the language defined for the recipient of the message. 
Assuming that the language was Japanese, the salutation that would be 
retumed by the getjist_value query would be "san". Assuming further that the 
$lasbiame of the recipient of the message is Kojima. The appropriate 
salutation of "Kojima-san" would be created by the program logic shown in 
Figure 11. 

Application to Multi-Region Market Research Study 

A particular application of the above described language based 
generation of email Is in the area of multi-region market research study, 
involving regions where multi-language are spoken. Such maricet research 
studies, especially ones involving automatically generated emails fix)m the 
maricet research study service to the panelists, can benefit firom the present 
Invention. Conducting and processing multi-region maricet research studies is 
the subject of co-pending U.S. Patent Application number <to be assigned>, 
entitied Multi-Region Market Research Study Processing, filed 
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contemporaneously and having common inventorship with the present 
application. The application is hereby also fully incorporated by reference. 

Conclusion 

Thus, It can be seen from the above descriptions, a novel method of 
handling electronic mail message in a manner that is sensitive to the language 
of the user of a system is presented. While the present invention has been 
described in terms of the above-Hlustrated embodiments, those skilled in the art 
will recognize that the invention is not limited to the embodiments described. 
The present invention can be practiced with modification and alteration within 
the spirit and scope of the appended claims. The description is thus to be 
regarded as illustrative instead of restrictive on the present invention. 
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